OPEN 3 ACCESS Freely available online 



•0-PLOS I o-^E 



Soluble Prokaryotic Overexpression and Purification of 
Bioactive Human Granulocyte Colony-Stimulating Factor 
by Maltose Binding Protein and Protein Disulfide 
Isomerase 

Bich Hang Do^ Han-Bong Ryu^ Phuong Hoang, Bon-Kyung Koo, Han Choe* 

Department of Physiology and Biomedical Institute of Technology, University of Ulsan College of Medicine, Seoul, Korea 



Abstract 

Human granulocyte colony-stimulating factor (liGCSF), a neutropliil-promoting cytol<ine, is an effective therapeutic agent 
for neutropenia patients wlio have undergone several cancer treatments. Efficient production of hGCSF using £ coli is 
challenging because the hormone tends to aggregate and forms inclusion bodies. This study examined the ability of seven 
different N-terminal fusion tags to increase expression of soluble hGCSF in £ coli. Four tag proteins, namely maltose-binding 
protein (MBP), N-utilization substance protein A, protein disulfide isomerase (PDI), and the b'a' domain of PDI (PDIb'a'), 
increased the solubility of hGCSF under normal conditions. Lowering the expression temperature from 30°C to 18°C also 
increased the solubility of thioredoxin-tagged and glutathione S-transferase-tagged hGCSF. By contrast, hexahistidine- 
tagged hGCSF was insoluble at both temperatures. Simple conventional chromatographic methods were used to purify 
hGCSF from the overexpressed PDIb'a'-hGCSF and MBP-hGCSF proteins. In total, 11.3 mg or 10.2 mg of pure hGCSF were 
obtained from 500 mL cultures of £ coli expressing PDIb'a'-hGCSF or IVlBP-hGCSF, respectively. SDS-PAGE analysis and silver 
staining confirmed high purity of the isolated hGCSF proteins, and the endotoxin levels were less than 0.05 E\J/\ig of 
protein. Subsequently, the bioactivity of the purified hGCSF proteins similar to that of the commercially available hGCSF was 
confirmed using the mouse l\/l-NFS-60 myelogenous leukemia cell line. The ECsqS of the cell proliferation dose-response 
curves for hGCSF proteins purified from MBP-hGCSF and PDIb'a'-hGCSF were 2.83±0.31 pM, and 3.38±0.41 pM, 
respectively. In summary, this study describes an efficient method for the soluble overexpression and purification of 
bioactive hGCSF in £ coli. 

Citation: Do BH, Ryu H-B, Hoang P, Koo B-K, Choe H (2014) Soluble Prokaryotic Overexpression and Purification of Bioactive Human Granulocyte Colony- 
Stimulating Factor by Maltose Binding Protein and Protein Disulfide Isomerase. PLoS ONE 9(3): e89906. doi:10.1371/journal.pone.0089906 

Editor: Paul D. Riggs, New England BioLabs, United States of America 

Received October 10, 2013; Accepted January 24, 2014; PublisKied March 3, 2014 

Copyright: © 2014 Do et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted 
use, distribution, and reproduction in any medium, provided the original author and source are credited. 

Funding: This work was supported by MRC grant (2008-0062286) and Priority Research Center Program (2009-0094054) of National Research Foundation funded 
by the Ministry of Education, Science and Technology of Korea. The funders had no role in study design, data collection and analysis, decision to publish, or 
preparation of the manuscript. 

Competing interests: The authors have applied for a Korean patent on the method of producing hGCSF in E. coli: The patent titled 'Soluble expression and 
purification method of active recombinant human GCSF' has been filed at the Korea Patent Office on Sept. 26, 201 3. The patent application number is '1 0-201 3- 
0114432'. This patent application does not alter the authors' adherence to all the PLOS ONE policies on sharing data and materials. Therefore all materials 
presented in the manuscript will be freely available to the scientific community although not for commercial purposes. 

* E-mail: hchoe@ulsan.ac.kr 

(9 These authors contributed equally to this work. 



Introduction 

Granulocyte colony-stimulating factor (GCSF), also known as 
pluripoietin, controls the production, differentiation, and function 
of granulocytes, which account for 70% of white blood cells [1,2]. 
The recruitment of two monomers of GCSF triggers dimerization 
of the GCSF receptor and initiates a signaling cascade [3-5]. 
Production of GCSF, which is secreted predominantly by 
macrophages, fibroblasts and endothelial cells, is stimulated by 
several inflammatory stimuli, including interleukin- 1 [5, tumor 
necrosis factor-alpha, and hpopolysaccharide [6-8]. Human 
GCSF (hGCSF) has been approved for the treatment of 
neutropenia, a common disorder in cancer patients following 
radiotherapy or chemotherapy treatments, characterized by an 
extremely low number of neutrophils in the blood [9,10]. GCSF 
also has neuroprotective properties [11]; accordingly, the protein 



has been used as a protective agent in mouse models of various 
neurodegenerative diseases, including amyotrophic lateral sclerosis 
[12,13]. 

Human GCSF was initially purified from a tumor cell line that 
continuously secreted the protein [14]. When expressed in the 
methylotrophic yeast Pichia pastoris, hGCSF is secreted in a soluble 
form; however, the secreted protein is highly aggregated and must 
be solubUized using high concentrations of denaturants such as 
guanidine hydrochloride or urea. Consequendy, purification of the 
biologically active form of hGSCF from yeast requires the removal 
of these denaturants and refolding of the protein [15]. Escherichia 
coli also produces aggregated hGCSF in inclusion bodies (IBs) [16- 
22]; however, the overall yield of biologically active protein from 
these structures is usually low [23]. Alternatively, hGCSF can be 
secreted into the periplasm of E. coli [24,25], although low yields 
are also usually obtained using this method. Maltose-binding 
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protein (MBP), and stress-responsive proteins such as peptidyl- 
prolyl cis-trans isomerase B, bacterioferritin, and glutathione 

synthase, have previously been tested as fusion partners to increase 
the production of solubUized hGCSF in E. coli [26,27]. 

In this study, several new methods of overexpressing soluble 
hGCSF in the cytoplasm of E. coli were investigated, enabling 
efEcient production of biologically active protein. The following 
seven N-terminal fusion tags were used: hexahistidine (His6), 
thioredoxin (Trx), glutathione S-transferase (GST), MBP, N- 
utilization substance protein A (NusA), protein disulfide bond 
isomerase (PDI), and the b'a' domain of PDI (PDIb'a'). The MBP, 
NusA, PDI, and PDIb'a' tags increased the solubility of hGCSF 
markedly at 30°C. Lowering the expression temperature to 18°C 
also increased the solubility of Trx- and GST-tagged hGCSF, 
whereas His6-hGCSF was insoluble at both temperatures. The 
expression level and the solubility of the tag-fused hGCSFs were 
also tested in the E. coli Origami 2(DE3) strain that have mutations 
in both the thioredoxin reductase (trxB) and glutathione reductase 
(gor) genes, which may assist the disulfide bond formation in the 
cytoplasm of i?. coli [28-30]. Simple methods of purifying hGCSF 
from the PDIb'a' or MBP tagged proteins were developed using 
conventional chromatographic techniques. In total, 11.3 mg of 
biologically active hGCSF was obtained from 500 mL of culture. 
Silver staining indicated that the extracted hGCSF was highly 
pure and the endotoxin level was very low. The activity of the 
purified protein was measured using a bioassay with mouse M- 
NFS-60 myelogenous leukemia cells. 

Materials and Methods 

Construction of plasmids and expression in E. coli 

The hGCSF gene (Uniprot identifier: P09919-2) encodes a 
protein comprising 204 amino acids, the first 29 of which form the 
signal peptide. To enable the expression and purification of 
hGCSF in E. coli, a tobacco etch virus (TEV) protease recognition 
site (TEVrs; ENLYFQG) was appended to the N-terminus of 
mature hGCSF (175 amino acids), and two site-specific recombi- 
nation sequences, attBj (5'-GGGGACAAGTTTGTA- 
CAAAAAAGCAGGCTTC-3') and attBa (5'-AC- 
CCAGCTTTCTTGTACAAAGTGGTCCCC-3'), were added 
to each end of the gene sequence (Figures lA and IB). The 
hGCSF DNA sequence which is substituted Met' to Ala' was 
synthesized and subcloned into plasmid pUC57 (Genscript, 
Piscataway, NJ), which was then recombined with the pDO- 
NOR207 vector (Invitrogen, Carlsbad, CA) to produce the entry 
vector pENTR-hGCSF (Figure lA). LR recombination cloning 
between pENTR-hGCSF and seven destination vectors containing 
the relevant fusion tags (pDEST-HGWA, pDEST-HXGWA, 
pDEST-HGGWA, pDEST-HMGWA, pDEST-HNGWA, 
pDEST-PDI, and pDEST-PDIb'a') [31,32] was performed to 
produce expression vectors containing tagged hGCSF. The 
expression plasmids were confirmed by DNA sequencing (Macro- 
gen, Daejeon, Korea) and then transformed into E. cofi BL21(DE3) 
and Origami 2(DE3). 

To overexpress hGCSF, the transformed BL21(DE3) cells were 
grown at 37°C in 200 rpm of shaking incubator in 2 mL of Luria- 
Bertani (LB) broth containing 50 |a,g/mL ampiciUin. For the 
culture of the transformed Origami 2(DE3), 12.5 |Xg/mL tetracy- 
cline was also added. One mM isopropyl-|3-D-thiogalactoside 
(IPTG) was added at 0.4~0.6 OD^oo to induce the expression of 
the hGCSF fusion proteins. The cells were harvested after 
incubation for 5 h at 30°C or 12 h at 18°C. 



Purification of hGCSF from the PDIb'a'-hGCSF fusion 

protein 

E. coll BL21(DE3) cells transformed with the PDIb'a'-hGCSF 
expression vector were cultured for 12 h at IH^C in 500 mL of LB 
medium. When ODgoo was reached to 0.4~0.6, 1 mM IPTG was 
added to induce the expression of the fusion protein. The collected 
cells were resuspended in 50 mL of immobilized metal ion affmity 
chromatography (IMAC) binding buffer comprising 50 mM Tris- 
HCl (pH 8.0), 500 mM NaCl, and 5% glycerol (v/v). The solution 
was sonicated until completely transparent and then centrifuged 
for 20 min at 27,000 g to generate the supernatant. After 
eriuilibrating with binding buffer, the pre-packed 3x5 mL Hi- 
sTrap HP column (GE Healthcare, Pisc:ataway, NJ) was fed with 
the lysate solution and non-specific proteins were then removed by 
washing with IMAC buffer containing 100 mM imidazole. The 
PDIb'a'-hGCSF flision protein was eluted in IMAC buffer 
containing 500 mM imidazole. To support TEV protease 
cleavage, the buffer was then exchanged to NaCl-free IMAC 
buffer (50 mM Tris-HCl, pH 8.0, 5% glycerol (v/v)) using a 
dialysis membrane (Viskase, Darien, Illinois). For digestion, the 
fusion protein was incubated with TEV protease at a ratio of 1:20 
for 12 h at 18°C. For IMAC, the digested sample was loaded onto 
a pre-packed 2x5 mL HisTrap HP column filled with IMAC 
buff('r. Unlike other proteins in solution, hGCSF had a low affmity 
to the Ni resin and was easily eluted from the HisTrap column 
using IMAC buffer containing 50 mM imidazole. Based on the 
chromatogram, the collected hGCSF was analyzed by 10% Tris- 
tricine SDS-PAGE. 

Purification of hGCSF from the MBP-hGCSF fusion protein 

E. coli BL21(DE3) cells transformed with the MBP-hGCSF 
expression vector were cultured for 12 h at 18°C in 500 mL of LB 
medium and induced by 1 mM IPTG when ODfioo was 0.4~0.6. 
Due to the high affinity of MBP-hGCSF to the MBP column, a 
2x5 mL MBPTrap HP column (GE Healthcare) was used as the 
first purification step. The cells were resuspended in 50 mL of 
MBP-binding buffer comprising 50 mM Tris-HCl (pH 8.0), 
0.5 mM EDTA, 200 mM NaCl, and 5% glycerol (v/v), and then 
sonicated to form a soluble solution. The supernatant was loaded 
onto a 2x5 mL MBPTrap HP column equilibrated with MBP- 
binding buffer. Non-specific bound proteins were removed by 
washing with binding buffer and MBP-hGCSF was eluted with 
binding buffer containing 10 mM maltose monohydrate. The 
eluted sample was diluted until the final concentration of NaCl 
was 50 mM and then cleaved with TEV protease under the same 
conditions as described for PDIb'a'-hGCSF. Cleaved hGCSF was 
then purified using the same method of hGCSF cleavage from 
PDIb'a'-hGCSF. 

SDS-PAGE and silver staining 

Proteins were separated and visualized on a 1 0 "/« Tris-tricine gel 
stained with Coomassie Brilliant Blue R-250 (AMRESCO, Solon, 
OH). The expression, solubility, and purity were quantified using 
ImageJ software (http://imagej.nih.gov/ij). For silver staining, the 
polyacrylamide gel was placed into Fixative Enhancer Solution 
(Bio-Rad Laboratories, Hercules, CA) for 20 min and then rinsed 
with distilled water to increase the sensitivity and contrast of the 
staining. Staining and developing were performed using a mixture 
of silver complex solution, reduction moderator solution, and 
image development reagent (Bio-Rad Laboratories, Hercules, 
CA). The reaction was stopped by the addition of 5% acetic acid. 
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Figure 1. Construction of tKie hGCSF expression vectors and schematic representations of the domain structures. A. The method of 
construction and vector map of the tag-hGCSF construct. All fusion constructs were generated in the same way via LR recombination cloning. 
Expression of the fusion proteins in £ coli was controlled by the IPTG-inducible T7 promoter, and ampicillin was used as the selection marker. B. 
Schematic representation of the seven hGCSF fusion proteins used in this study (His6-, Trx-, GST-, PDIb'a'-, MBP-, PDI-, and NusA-hGCSF). The arrow 
indicates the TEV protease cleavage site. Black is extra sequences from BP and LR recombinations. The amino acid sequence of mature hGCSF is also 
shown. 

doi:10.1371/journal.pone.0089906.g001 



Endotoxin assay 

To remove endotoxiiis from purified hGCSF, the solution was 
incubated with 1% Triton X-1 14 (Sigma-Aldrich, St. Louis, MO) 
at 4°C for 30 min. Triton X-1 14 was accumulated after 
incubating the sample at room temperature and removed by 
centrifugation at 9,000 g for 10 min [33]. The Endpoint 
Chromogenic Limulus Amebocyte Lysate test (Lonza, Basel, 
Switzerland) was used to quantify the remaining endotoxin in 
the target solution. Briefly, Limulus Amebocyte Lysate was 
incubated with the hGCSF sample at 37°C for 10 min before 
the substrate was added. Stop agent (25% v/v glacial acetic acid) 
was then added to the mixture and the released p-nitroaniline was 
evaluated by photometric measurement at 405-4 1 0 nm. 

Cell proliferation assay 

The M-NFS-60 mouse myelogenous leukemia cell line [34,35], 
kindly provided by Dr. Kyung-Woon Kim (Rural Development 
Administration, Suwon, Korea), was grown in RPMT1640 
medium (Invitrogen, Carlsbad, CA) containing 10% fetal bovine 
serum, IX penicillin and streptomycin (Invitrogen), and 0.05 mM 
P-mercaptoethanol [36]. The cells were maintained at 37°C in a 
humidified atmosphere containing 5% CO2. The bioassay of 
purified hGCSF using M-NFS-60 cells was based on the 3-(4,5- 
dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide (MTT) as- 
say (Invitrogen). The cultured cells were seeded at a density of 
3x10* ceUs/weU into 96-weU plates containing growth medium. 
To determine its effect on proliferation of the cells, dififerent 
concentrations (0.1, 1, 10, 100, 1,000, 10,000 and 100,000 pg/ 
mL) of commercially available hGCSF purified from IB (Gen- 
script, Piscataway, NJ) and hGCSF produced from the PDIb'a' 
and MBP fusion proteins were added to each well in a final volume 
of 100 |xL. After 72 h of incubation, 15 ^L of 5 mg/mL MTT was 



added to each well and the cells were incubated in the dark at 
37°C for a further 4 h. After removing all solutions from the cells, 
100 |a.L of dimethyl sulfoxide was added to each well to completely 
solubUize the formed aggregates. The optical density of the 
solution was measured at 570 nm using an ELISA plate reader 
(Molecular Devices, Sunnyvale, CA). 

Data analysis 

A non-linear regression analysis was used to determine the M- 
NFS-60 cell proliferation dose-response to hGCSF. The data were 
fitted using the following equation and Microsoft Excel software, 
where Re is response of the cells, Bl is the baseline at low 
concentration, Max is the maximum response, cone is the 
concentration of the protein, and Hs is the HiU coefficient of 
stimulation, Bh is the baseline at high concentration, and Hi is the 
HiU coefficient of inhibition: 



Re = Bl + 



Max-Bl 



\ + 



\conc 



Max—Eh 

\conc J 



(1) 



All data are presented as the mean ± standard error (SE) of 
nS3 of 2 independent experiments. To determine the statistical 
significance of the responses of cells to hGCSF, group means were 
compared using a Student's Z-test or a one-way analysis of variance 
followed by Bonferroni's multiple comparisons test. Graphpad 
Prism 5 software (GraphPad, San Diego, CA) was used for 
statistical analyses and P<0.05 was considered significant. 
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Results 

Construction of plasmids and expression of tagged 
hGCSF in E. coli 

To enable soluble expression of hGCSF in the cytoplasm of E. 
coli, the following seven tags were fused to the N-terminus of the 
protein via LR recombination cloning: His6, Trx, GST, PDI b'a', 
MBP, PDI, and NusA (Figure 1). A TEVrs was also inserted 
between each tag and hGCSF to facilitate removal of the tags 
during purification, and the sccjucncc was codon-optimized for E. 
coll expression (Figure* IB). \'cct()rs containing the fusion tags were 
recombined with the hGCSF plasmid, then the resulting plasmids 
were sequence-verified and transformed into the BL21(DE3) E. coli 
strain, which lacks protease expression. 

Expression of the hGCSF fusion genes in E. coli was controlled 
by a T7 promoter and induced with 1 mM IPTG at two different 
expression temperatures of 30°C and 18°C. The expression levels 
of all tagged hGCSF proteins were 3 3-68 "/o, and the expression 
levels of all proteins were higher at 18°C than 30°C (Figure 2 and 
Table 1). The solubilities of the proteins varied depending on both 
the type of fusion tag used and the expression temperature. The 
solubility of hGCSF at 30°C was markedly enhanced by the 
addition of the MBP, NusA, PDI, and PDIb'a' tags (Figure 2B and 
Table 1). Lowering the expression temperature to 18°C addition- 
ally increased the solubility of the Trx-hGCSF and GST-hGCSF 
proteins to similar levels (Figure 2A and Table 1); however, His6- 
hGCSF was insoluble at both expression temperatures. We also 
tested E. coli Origami 2(DE3), a strain that may promote disulfide 
bond formation in the cytoplasm of E. coli, as an expression host. 
The expression levels of the fusion proteins in Origami 2(DE3) 
were lower than those in BL21(DE3), and the solubilities were 
similar at both 18°C and 30°C (Figure 81). Based on the 
expression level, solubilities and sizes of the tagged proteins, 
PDIb'a'-hGCSF and MBP-hGCSF in BL21(DE3) were selected 
for further study. 

Purification of hGCSF from the PDIb'a'-hGCSF fusion 
protein 

Separation of hGCSF from the PDIb'a'-hGCSF fusion protein 
was performed by two rounds of IMAC, with an intervening TEV 
protease digestion step (Figure 3A). IMAC was possible because all 
of the tags used in the study contained an additional His6 or His8 
tag at their N-terminal end (Figure IB). Cells transformed with the 
plasmid containing PDIb'a'-hGCSF were induced with IPTG and 
then collected (Figure 3B, lane 2). The cells were lysed and 
centrifuged to harvest the supernatant (Figure 3B, lane 3), which 
was then loaded onto a Ni column and the binding protein was 
eluted after a washing step (Figure 3B, lane 4). Most of the non- 
specific proteins were removed at this step; however, some minor 
contaminant bands were observed. Despite the presence of these 
additional proteins, TEV protease digestion was performed. After 
optimizing the digestion conditions (data not shown), the majority 
of the PDIb'a'-hGCSF protein was cleaved by TEV protease 
(Figure 3B, lane 5). A second HisTrap HP column was then used 
to remove the PDIb'a' tag, undigested PDIb'a'-hGCSF, and TEV 
protease, which also contained a His6-tag. Cleaved hGCSF weakly 
bound to the Ni column and was eluted liy .^0 mM imidazole 
(Figure 3C). An SDS-PAGE analysis revealed the absence of any 
contaminating proteins after this step (Figure 3B, lane 6). Silver 
staining of the SDS-PAGE gel under reducing and non-reducing 
conditions showed that the purified hGCSF protein was highly 
pure and mostiy monomeric (Figure 3D). Typically, 11.3 mg of 
hGCSF was obtained from a 500 mL culture of E. coli expressing 
PDIb'a'-hGCSF, with a yield of 36.7% (Table 2). After treatment 



with Triton X-114, the endotoxin level of hGCSF purified from 
the PDIb'a'-hGCSF fusion protein was 0.05 EU/ng. 

Purification of hGCSF from the IVlBP-hGCSF fusion protein 

Figure 4A shows an outline of the process used to purify hGCSF 
from MBP-hGCSF in the cell lysate. MBP chromatography 
isolated the MBP-hGCSF fusion protein from the total protein 
mixture with a purity of approximately 80% (Figure 4B, lane 4). 
After cleavage of the fusion protein with TEV protease (Figure 4B, 
lane 5), the sample was applied to a Ni-NTA column and purified 
hGCSF was obtained by eluting with 50 mM imidazole (Figure 4B, 
lane 6; Figure 4C). Similar to the highly pure hGCSF (approx- 
imately 99%) obtained from PDIb'a'-hGCSF, silver staining of the 
SDS-PAGE gel under reducing and non-reducing conditions 
revealed the presence of highly pure hGCSF isolated from MBP- 
hGCSF (Figure 4D). Most of the purified protein was monomeric; 
although a small amount of hGCSF dimer was observed under 
non-reducing conditions (Figure 4D). Typically, 10.2 mg of 
purified hGCSF was obtained from a 500 mL culture of MBP- 
hGCSF. This total yield of 38.3% (Table 2) was lower than from 
PDIb'a'-hGCSF. After treatment with Triton X-114, the level of 
endotoxin in the purified hGCSF sample was 0.013 EU/ng. The 
endotoxin level of bio-products is typically less than 1 EU/ Hg. 

Biological activity of hGCSF 

The bioactivities of the purified hGCSF proteins were measured 
using an MTT assay and the mouse M-NFS-60 myelogenous 
leukemia cell line. The number of M-NFS-60 cells increased 
dramatically after incubation with commercially available hGCSF 
or hGCSF purified from tiie PDIb'a'-hGCSF or MBP-hGCSF 
fusion proteins (Figures 5A-E). At concentrations below 1 nM, the 
dose-response curves were sigmoidal for all three forms of hGCSF 
(Figure 5E); however, higher concentrations produced mild 
inhibition, resulting in a bell-shaped curve (Figure 5E). The 
EC50S of commercial hGCSF, hGCSF from MBP-hGCSF, and 
hGCSF from PDIb'a'-hGCSF were 10.69±2.62 pM, 
2.83±0.31 pM, and 3.38±0.41 pM, respectively, with HiU 
coefficients of 1.06±0.29, 1.00±0.05, and 1.06±0.11, respective- 
ly. The differences between the ECsoS and Hill coefficients were 
not statistically significant, suggesting that the hGCSF proteins 
purified from MBP-hGCSF and PDIb'a'-hGCSF are as slightiy 
better effective as commercially available hGCSF. 

Discussion 

Many human proteins expressed in prokaryotes such as E. coli 
are prone to accumulation in IBs. Consequently, time-consuming 
solubilization and refolding are necessary to generate the purified 
proteins; processes that are also hampered by low yields, poor 
reproducibility, and the generation of proteins with low biological 
activity [23,37]. When expressed in E. coli, hGCSF is also 
insoluble, and so to address this problem, this study examined the 
effect of seven dififerent fusion tags that function as chaperones, as 
well as the effect of a low expression temperature, on the solubility 
of hGCSF. 

The MBP, PDI, PDIb'a', and NusA tags solubilized greater 
than 70% of the hGCSF fusion protein at 30°C, whereas the 
solubilities of the Trx-, GST-, and His6-tagged proteins were low 
at this temperature (Table 1, Figure 2). MBP is thought to act as a 
general molecular chaperone [38] by binding to hydrophobic 
residues present on protein surfaces [39]. MBP-tagged proteins 
can be easily purified with commercially available MBP-binding 
columns. PDI forms and breaks disulfide bonds of proteins in the 
lumen of the endoplasmic reticulum. The cytoplasm is usually a 
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His Trx GST PDIb'a' 



MBP 



PDI 



NusA 




His Trx 



GST PDIb'a' 



MCTPSTPST PSTPS 



MBP 



PDI NusA 



MCT PS TPST PS 




Figure 2. Expression levels of hGCSF fused with seven different tags in £ (ro//BL21(DE3). Protein expression was induced with 1 mM IPTG 
at either 18 C (A) or 30°C (B). After sonication, 20 |ig of each total protein was loaded onto a 10% Tris-tricine gel. The arrows indicate the hGCSF 
fusion proteins. M, molecular weight size marker; C, total protein before IPTG induction (control); T, total protein after IPTG induction; P, protein in the 
cell pellet after sonication; S, protein in the supernatant after sonication. 
doi:1 0.1 371 /journal.pone.0089906.g002 



reducing environment that prevents proper disulfide bond 
formation, but PDI increases tlie production of soluble proteins 
in both the cytoplasm [40] and periplasm of £. coli [41]. PDI is 
composed of four thioredoxin-like domains, named a, b, b', and a'. 
The a and a' domains display redox-active catalytic and 
chaperone activities, whereas the b and b' domains only 
demonstrate some chaperone functions [42] . Previous experiments 
in our laboratory have shown that PDIb'a' increases the solubility 
of several proteins to the same degree as PDI (data not shown); 



however, the data presented here show that PDIb'a' was less 
effective than PDI at solubHizing hGCSF. NusA was suggested as a 
solubUizing tag protein based on the revised Wilkinson-Harrison 
solubility model [43,44], which predicted NusA to be 95% soluble 
and to improve the solubility of several proteins. PDI and PDIb'a' 
were also predicted to be good solubilizing agents according to this 
model (data not shown). The revised WiUdnson-Harrison solubility 
model considers the number of four turn-forming residues (Asn, 
Gly, Pro, and Ser) and determines the net charge by subtracting 



Table 1. 


Expression levels and solubilities of hGCSF fused with 


seven different N-terminal tags. 










Tag 


Tag size (kDa) 


Fusion protein size 

(kDa) Expression (%) 


Solubility (%) 












18°C 


30°C 


18°C 


30°C 


hGCSF (18.J 


! kDa) His6 


0.8 


23.5 


43.8 


33.6 








Trx 


11.8 


35.3 


61.4 


48.8 


98.3 


5.0 




GST 


25.7 


49.2 


41.3 


40.0 


78.4 


3.2 




PDIb'a' 


35.6 


59.1 


66.3 


42.2 


96.0 


73.5 




iVlBP 


40.3 


63.8 


61.4 


58.4 


96.5 


88.1 




PDI 


55.1 


78.7 


55.6 


43.8 


98.1 


89.3 




NusA 


54.9 


78.4 


68.0 


44.8 


97.5 


89.5 



doi:l 0.1 371 /journal.pone.0089906.t001 
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Figure 3. Purification of liGCSF from the PDIb'a'-KiGCSF fusion protein expressed in £ C£i//BL21(DE3). A. Overview of the purification 
process using IMAC chromatography and TEV protease digestion. B. SDS-PAGE analysis of hGCSF at various stages of the purification process. Ten 
micrograms of total protein was loaded onto a 10% Tris-tricine gel. iVl, molecular weight marker; lane 1, total protein before IPTG induction; lane 2, 
total protein after IPTG induction; lane 3, soluble fraction after cell sonication; lane 4, PDIb'a'-hGCSF fusion protein purified by the first round of IMAC 
(59.1 kDa); lane 5, PDIb'a'-hGCSF fusion protein following TEV protease cleavage showing separated PDIb'a' (35.6 kDa) and hGCSF (18.8 kDa); lane 6, 
purified hGCSF after the second round of IMAC (1 8.8 kDa). C. IMAC chromatogram of the TEV protease cleaved PDIb'a'-hGCSF fusion protein showing 
clear separation of the PDIb'a' tag and hGCSF. D. Silver staining of 7.5 \ig of purified hGCSF. M, molecular weight marker; R, reduced hGCSF; NR, non- 
reduced hGCSF. 

doi:1 0.1 371 /journal.pone.0089906.g003 

the number of acidic residues from the number of basic residues. (data not shown), despite the fact tliat hGCSF fu.sed with these tags 
However, this model may have some limitations because it showed good solubility. 

predicted relatively low solubility for the MBP, Trx, and GST tags With the exception of His6-hGCSF, lowering the expression 

temperature from 30°C to 18°C increased the solubility of all 



Table 2. The characteristics of hGCSF purified from PDIb'a'-hGCSF and MBP-hGCSF fusion proteins expressed in £ coli. 





Purification step 


hGCSF purified from PDIb'a'-hGCSF 






hGCSF purified from MBP-hGCSF 






Total proteln(mg) 


Purity (%) 


hGCSF(mg) 


Yield (%) 


Total protein(mg) Purity (%) hGCSF (mg) 


Yield 1%) 


Cell weight 


1500 








1500 




Supernatant 


140 


69.1 


30.8 


100 


118.8 75.9 26.6 


100 


1^^ Chromatography 
(IMAC/MBP) 


71.5 


73.3 


16.7 


54 


79.8 88 20.7 


77.8 


2"^ Chromatography 


11.4 


99 


11.3 


36.7 


10.3 99 10.2 


38.3 
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Figure 4. Purification of hGCSF from the MBP-hGCSF fusion protein expressed in £ co// BL21 (DEB). A. Overview of the purification 
process using an MBP affinity column, TEV protease digestion, and IMAC. B. SDS-PAGE analysis of hGCSF at various stages of the purification process. 
M, molecular weight marker; lane 1, total protein before IPTG induction; lane 2, total protein after IPTG induction; lane 3, soluble fraction after cell 
sonication; lane 4, MBP-hGCSF fusion protein purified using an MBP affinity column (63.3 kDa); lane 5, MBP-hGCSF fusion protein after cleavage with 
TEV protease showing separated MBP (40.3 kDa) and hGCSF (18.8 kDa); lane 6, purified hGCSF after IMAC (18.8 kDa). C. IMAC chromatogram of the 
TEV protease cleaved MBP-hGCSF fusion protein showing clear separation of the MBP tag and hGCSF. D. Silver staining of 7.5 |jg of purified hGCSF. M, 
molecular weight marker; R, reduced hGCSF; NR, non-reduced hGCSF. 
doi:1 0.1 371 /journal.pone.0089906.g004 



tagged hGCSF proteins (Table 1, Figure 2). Low expression 
temperatures have been successfully used in the past to increase 
the solubility of many proteins expressed in E. coli [45-50]; 
however, the molecular mechanisms responsible for this effect are 
not fuUy understood at present. The cold temperature protein 
chaperones are induced at low temperatures [51]; peptidyl-prolyl 
isomerase is a known cold temperature protein chaperone that 
catalyzes cis/trans isomerization of the peptide bonds found in 
proline residues [52]. In addition, several ATP-consuming heat 
shock proteins may also play a role in improving protein solubility 
at low expression temperatures [53]. Although highly inducible by 
heat shock treatment, these proteins are expressed at normal 
temperatures and have chaperone functions. However, the effects 
of lowering the expression temperature on protein solubility 
cannot be generalized because His6-tagged hGCSF was not 
soluble at all at 18°C. 

The effects of hGCSF purified from MBP-hGCSF or PDIb'a'- 
hGCSF on the proliferation of M-NFS-60 cells were slightly higher 
than that of commercially available hGCSF (Figure 5E). The EC r,,, 
values for hGCSF purified from MBP-hGCSF (2.83 pM) and 



PDIb'a'-hGCSF (3.38 pM) were consistent with a previous study 
that reported an ECso value in the range of 0.8-6 pM for hGCSF 
[25,54,55]. At high concentrations, the purified hGCSF proteins 
induced mild inhibition of cell proliferation, resulting in a bell- 
shaped biphasic dose-response curve (Figure 5E). This is consistent 
with a previous report that other cytokines also show a biphasic 
dose-response curve [56]. 

There are three splicing variants of hGCSF. The short isoform 
(b) used in this study is reportedly more active than the longer 
isoform (a) [57], and the third isoform lacks the region spanning 
amino acids 37 to 73. In this study, we substituted the first amino 
acid (Ala) with Met, and this mutation increased binding of 
hGCSF to its receptor [58] and facihtated PEGylation of the N- 
terminus of the protein, which increased the half-life of GCSF in 
blood [59]. 

Mature hGCSF contains five cysteine residues, four of which 
form two native intramolecular disulfide bonds, Cys' -Cys ' and 
Cys*''-Cys' A previous study in which Cys"* was mutated to Ser 
demonstrated that Cys"* is not required for bioactivity of hGCSF 
[60]. However, during folding of hGCSF, intermolecular disulfide 
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Figure 5. Cell proliferation assay of purified hGCSF using the M-NFS-60 cell line. Light microscopy images of IVI-NFS-60 cells incubated 
without (A) or with (B-D) 1 ng/mL hGCSF for 48 h. Commercially available hGCSF (B), hGCSF purified from IVlBP-hGCSF (C), and GCSF purified from 
PDIb'a'-hGCSF (D) were used. The scale bar represents 100 |im. E. Dose-response curve of M-NFS-60 cells following exposure to different 
concentrations of purified hGCSF and commercial hGCSF. The number of cells was measured as the OD570. •, commercially available hGCSF; A, 
hGCSF purified from MBP-hGCSF; hGCSF purified from PDIb'a'-hGCSF. Data are represented as the mean ± SE of naS of 2 independent experiments. 
Statistical significance compared to no hGCSF treatment group: * p<0.05; ** p<0.01; ***, p<0.001. 
doi:10.1371/journal.pone.0089906.g005 



bonds between two Cys residues or Cys and another Cys 
residue can occur in aggregates [61]. The formation of subsequent 
dimers or multimers can render hGCSF insoluble in E. coli 
cytoplasm. As a result of the non-optimal spatial orientation of the 
molecules, the activity of the GCSF dimer is much lower than that 
of the GCSF monomer in vitro [62]. Some effective solutions, such 
as the mutation of Cys'" [21,36] or the addition of a specific 
secretory signal peptide that directs the secretion of hGCSF into 
the periplasmic space [24], have been used to overcome this 
obstacle in E. coli. Here, soluble monomeric hGCSF with 
bioactivity similar to that of hGCSF purified from HEK cells 
was obtained using a fusion protein strategy and a low expression 
temperature. 



Mature hGCSF is glycosylated at Thr . One limitation of 
using E. coli to produce hGCSF is the lack of glycosylation 
machinery in the bacterial cells; therefore, overexpressed hGCSF 
obtained from E. coli is non-glycosylated. Glycosylation prevents 
protein aggregation and increases the half-life of circulating 
proteins in the blood by protecting proteins from protease 
cleavage; however, it does not affect the binding of proteins to 
receptors. Indeed, the clinical effects of glycosylated and non- 
glycosylated hGCSF on chemotherapy-induced neutropenia were 
not significantly different statistically in a clinical trial [63] . 
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Conclusion 

This study demonstrates that fusion proteins and a low 
expression temperature can be used to successfully express soluble 
hGCSF in the cytoplasm of £. coli. Using simple chromatographic 
techniques and TEV protease digestion, >10mg of highly 
bioactive hGCSF was purified from 500 mL cultures of cells 
expressing MBP-hGCSF or PDIb'a'-hGCSF. 

Supporting information 

Figure SI Expression levels and solubilities of hGCSF 
fused with seven different tags in E. coli Origami 
2(DE3). Protein expression was induced with 1 mM IPTG at 
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